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(57) Abstract: llie invention relates to methods for mapping and characterizing catalytic domains in enzymes, preferably plant 
enzymes and those enzymes within the carotene synthesis family and more sp>ccifically E-cyclase enzymes regulating formation of 
e,e-carotene. The methods include reverse PCR and site-directed mutagenesis for generating chimera and truncations or site-directed 
mutations of enzymes, respectively. These chimera, truncations or site directed mutants of e-cyclase enzymes are useful in the 
characterization of the sequence residues conferring catalytic domains for the enzymes, and more specifically, the identification of 
single residues regulating catalytic activity for enzymes that are important in plant growth and photosynthesis. Chimeric enzymes 
generated by the methods of the invention can also be used to create transgcnci hosts which arc augmentated in their expression of 
speciflc carotene products. ■ 
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Methods for Determining Ring Number in Carotenoids By Lycopene Epsllon - Cyclases 

and Uses Thereof 

Cross Reference to Related Applications 

This application claims priority under 35 U.S.C. §1 .1 19(e) to provisional 
application serial no. 60/261,473, filed January 12, 2001. The contents of this 
application are hereby incorporated by reference. 
Federally-Sponsored Research 

The research described herein was supported by a grant from the National 
Science Foundation (MCB9631257). The government has certain rights to this 
invention. 

Background of the Invention 
Field of the Invention 

The present invention provides methods for mapping catalytic domains of 
eukaryotic or prokaryotic enzymes of carotenoid biosynthesis and metabolism. The 
present invention also provides polynucleotides encoding chimeric enzymes,, truncated 
enzymes and site-directed mutants of enzymes of carotenoid biosynthesis and 
metabolism and their use in identifying catalytic domains of the enzymes. The invention 
provides methods for obtaining transgenic hosts augmented in their expression of 
bicyclic-e-carotene by transformation with polynucleotide constructs encoding chimeric 
enzymes. The present invention also describes the amino acid sequence of the catalytic 
domain for an s-cyclase which catalyzes lycopene into bicyclic-e-carotene. 
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Description of the Related Art 

Carotenoids with cyclic end groups are present in the photosynthetic reaction 
complexes of plants, algae, and cyanobacteria (1, 2). These llpid-soluble isoprenoid 
pigments protect against photooxidation, harvest light for photosynthesis, and dissipate 
excess light energy absorbed by the antenna pigments (3, 4, 5). The cyclization of the 
linear, pink carotenold lycopene (Figure 1) is a pivotal branch point in the pathway of 
carotenoid biosynthesis in green plants. Two types of cyclic end groups and derivatives 
thereof, are commonly found in carotenoids of plants: p and e rings. These two end 
groups differ only in the position of the double bond within the cyclohexene ring (Figure 
1). Carotenoids with two p rings are ubiquitous (1, 2) and include p-carotene and 
zeaxanthin, pigments thought to serve primarily in protecting against photo-oxidation 
and/or in dissipation of excess light energy. Carotenoids with one p and one £ ring are 
also common in plants and include lutein, the predominant carotenoid in the 
light-harvesting antenna of most green plants. Carotenoids with two t rings (e, s- 
carotene) are not commonly found, other than in trace amounts, in plants and algae (1). 

The symmetrical bicyclic, yellow carotenoid pigment, e,e-carotene, is associated 
with the photosynthetic apparatus in oxygenic photosynthetic organisms and plays a 
vital role in protecting against potentially lethal phdto-oxidative damage. Accordingly, 
these compounds may have widespread industrial applications in promoting plant 
grov^^h and photosynthesis for large-scale agricultural operations, [to modify carotenoid 
colored plants tissues] Epsilon, epsilon-carotene may also have commercial use as food 
dyes and colorings as well as a pharmaceutical use as a chemopreventative agent (31). 
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Romaine lettuce is one of the rare plant species that produces an abundance of 
an s,s-carotenoid, the dihydroxy s.s-carotenoid lactucaxanthin. The present Inventors 
previously described the isolation and characterization of a gene encoding the lycopene 
epsilon cyclase from lettuce which forms B,s-carotene from lycopene, and found that the 
enzyme shares about 65% sequence identity with an Arabidopsis cyclase gene, 
lycopene epsilon cyclase (PCT/US99/10461, which is incorporated by reference herein 
in its entirety). The lettuce enzyme adds two epsilon rings to lycopene to form 8,s- 
carotene, whereas the Arabidopsis enzyme adds only one epsilon ring to form the 
monocyclic 5-carotene, e,\|/-carotene. 

The previously described methods for producing carotenoids with two epsilon 
rings are deficient in identifying the molecular basis for £-cyclase catalytic activity 
effecting bicyclic e-ring additions to lycopene versus the addition of only a single epsilon 
ring. Accordingly, there exists a need in the art for understanding the sequence identity 
of catalytic domains for this family of plant enzymes which are essential to plant growth 
and photosynthesis, and methods for identifying the same. 

With such sequence information at hand, novel enzymes, which participate in the 
formation of s,E-carotene, can be created by replacing portions of one gene from one 
species with an analogous sequence of a related gene from another species. Through 
transfection of host cells with any such chimeric gene constructs or constructs 
containing site-directed mutants of genes, expression of recombinant carotenoid- 
synthesizing enzymes can be used to augment production of s.e-carotene in cells, which 
otherwise produce little or no E,B-carotene. 
Summary of the Invention 
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Accordingly, the present invention provides a metliod for mapping catalytic 
domains of eukaryotic enzymes of carotenoid biosynthesis and metabolism by chimeric 
enzymes. 

The present invention provides a method for mapping catalytic domains of 
eukaryotic enzymes of carotenoid biosynthesis and metabolism by truncation of the 
enzymes. 

The present invention provides a method for mapping catalytic domains of 
eukaryotic enzymes of carotenoid biosynthesis and metabolism by site-directed 
mutation of the enzymes. 

The present invention also relates to polynucleotides encoding chimeric 
enzymes, truncated enzymes and site-directed mutants of enzymes of carotenoid 
biosynthesis and metabolism. 

The present invention provides a method for augmenting the production of e.e- 
carotene in transformed host cells by transfection with constructs encoding chimeric 
carotenoid synthesizing enzymes from different sources. 

The present invention relates to an amino acid sequence for a catalytic domain of 
s-cyclase for catalyzing lycopene into bicyclic-e-carotene. 

Finally, the present invention describes the amino acid sequence for a catalytic 
domain of a lettuce e-cyclase, and permissible amino acid substitutions. 

These and other objects of the present invention have been realized by the 
present inventors as described below. 
Brief Description of the Drawings 



4 



wo 02/061050 PCT/US02/00667 

A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better understood by 
reference to the following detailed description when considered in connection with the 
accompanying drawings, wherein: 

Figure 1 depicts possible routes of synthesis of cyclic carotenoids and some 
common plant and algal xanthophylis (oxycarotenoids) from lycopene. 

Figure 2 describes a PGR strategy for constructing a chimera of lettuce and 
Arabidopsis lycopene e-cyclase cDNAs. 

Figure 3 depicts HPLC elution profiles and absorption spectra of carotenoids 
produced in a lycopene-accumulating E. coli strain (14) in the presence of cDNAs 
encoding a lettuce lycopene e-cyclase (panel A) and an Arabidopsis lycopene e-cyclase 
(panel B). 

Figure 4 shows the alignment of deduced amino acid sequences of Arabidopsis 
(At) and lettuce (Ls) lycopene e-cyclases. Residues identical for both sequences in a 
given position are in white text on a black background. A region of interest is underlined. 

Figure 5 is a schematic illustration of truncated and chimeric lycopene e-cyclase 
cDNAs. 
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Figure 6 depicts tlie catalytic domain for determining the number of e rings added 
to lycopene by Arabidopsis and lettuce lycopene e-cyclase. The catalytic domain was 
mapped to a six amino acid region defined by the residues ALIVQF in the Arabidopsis e- 
cyclase and SHIVLM in the lettuce e-cyclase (see Figures 4 and 5). Deduced amino 
acid sequences of lycopene mono-e-cyclases from tomato (9), marigold (18) and potato 
are also displayed for this region. Similarly conserved residues are shown in black text 
on a gray background. Three amino acid residues in the lettuce bi-e-cyclase that differ 
significantly from those in the known mono-e-cyclases are in white text on a black 
background. Sequences of an Arabidopsis LCYb (a bicyclase introducing two beta 
rings) and an Adonis LCYe of mixed function are displayed below the lettuce LCYe with 
two residues of interest shown in white text on a black background. 

Figure 7 shows a neighbor-joining tree for deduced amino acid sequences of 
plant lycopene (3- and e-cyclases (LCYb and LCYe; 7, 9, 18) and of the related plant 
enzymes capsanthin-capsorubin synthase (CCS; 21) and neoxanthin synthase (NSY; 
22, 23). Reactions catalyzed by the various enzymes are illustrated below the tree. 

Figure 8 depicts an alignment of deduced amino acid sequences of plant p- and 
e-cyclases, neoxanthin synthases (NSY) and capsanthin capsorubin synthase (CCS) 
enzymes, and a cyanobacterial {Synechococcus PCC7942) p-cyclase. 

Detailed Description of the Preferred Embodiments 
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Recently, the present inventors characterized a romaine lettuce gene encoding 
the enzyme, lycopene e-cyclase, which catalyzes the addition of two epsilon rings to 
lycopene forming 8,8-carotene. For the green plant Arabidopsis thaliana, two distantly 
related single copy genes encode enzymes that catalyze the introduction of the p and e 
rings to lycopene (7). The Arabidopsis enzymes share 36% homology at the amino acid 
level. The Arabidopsis lycopene p-cyclase (LCYb) adds two rings to the symmetrical 
lycopene to form the bicyclic p-carotene (p,p-carotene; Figures 1 and 7). On the other 
hand, the lycopene e-cydase (LCYe) of Arabidopsis adds one e ring to form the 
monocyclic (J -carotene (8,\i/-carotene; Figures 1 and 7). These differences in function 
provide a mechanism for adjusting the proportions of p,p-carotenoids that are essential 
for photoprotection or the p,e-carotenoids that serve primarily to capture light energy for 
photosynthesis, while at the same time preventing formation of carotenoids with two e 
rings in Arabidopsis (7, 8, 9). 

These different catalytic properties for enzymes within the related family of 
carotenoid e-cyclase genes from different plant sources, lead the inventors to develop 
different strategies for mapping the catalytic domain of the s-cyclase enzyme from 
lettuce and Arabidopsis as well as potato, tomato, Adonis and marigold (see Figure 6). 
' The catalytic domains were then sequenced in order to identify the amino acid residues, 
which confer the ring number determination for the respective enzymes. 

Accordingly, the present invention provides different methods for mapping 
catalytic domains of eukaryotic or prokaryotic enzymes, more preferably, enzymes 
related to carotenoid biosynthesis and metabolism in plants. 
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A method of the present invention for identifying an enzyme-catalyzing domain in 
a eukaryotic or prokaryotic a carotenoid-synthesizing enzyme, comprises 

a) providing a first polynucleotide encoding a full-length enzyme and a second 
polynucleotide encoding a full-length enzyme, each polynucleotide being sub-cloned in 
tandem into a vector; 

b) providing a first primer for hybridizing to the first polynucleotide and a second 
primer for hybridizing to the second polynucleotide; 

c) performing an inverse polymerase chain reaction using the first and the 
second primer and the vector to obtain a construct containing a chimeric polynucleotide 
containing a 5' end of the first polynucleotide and a 3' end of the second polynucleotide; 

d) repeating steps b) and c) with a plurality of different first primers and a plurality 
of different second primers for obtaining a plurality of constructs containing different 
chimeric polynucleotides for scanning along the encoded amino acid sequence one 
amino acid at a time; 

e) transfecting a host cell with one or more of the plurality of constructs and 
growing the host cell under conditions for expressing chimeric proteins encoded by the 
chimeric polynucleotides; 

f) performing enzyme catalysis with the chimeric proteins on an enzyme-specific 
substrate in the host cell, 

wherein the substrate is preferably, a symmetrical carotenoid such as lycopene; 

and 

g) identifying the enzyme-catalyzing domain encoded by the chimeric proteins by 
identification of at least one carotenoid compound from the enzyme catalysis of step f). 
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The length of the fragment for the first and the second polynucleotide, 
respectively, can vary with any such chimeric construct, and is limited by the primer 
pairs used to generate the construct. Thus, the extent to which either the first or second 
polynucleotide is included in the construct can be used to determine both the position 
and sequence identity for the catalytic domain of the given gene. Examples of chimeric 
gene constructs include a vector containing the first half (5') of a romaine lettuce 
cyclase gene in combination with the second half (3*) of another plant cyclase gene, 
such as the Arabidopsis or potato gene, or the first half of an Arabidopsis or other 
mono-epsilon cyclase gene In combination with the second half (3') of a lettuce cyclase 
gene. In some examples, chimeric constructs were obtained where the catalytic domain 
of one polynucleotide was replaced with that of the other polynucleotide. 

Figure 5 depicts several chimeric polynucleotides generated by the above- 
described method wherein a catalytic domain for one enzyme was replaced with the 
domain from a related gene of a different source. Accordingly, these chimeric constructs 
were used both to map the catalytic domains for the enzymes of the invention and to 
identify the residues responsible for regulating specific enzymatic activity. More 
specifically, the chimeric constructs were designed to identify residues, which confer 
ring number determinants. 

The present inventors have identified the molecular basis for £-cyclase catalytic 
activity effecting bicyclic e-ring additions to lycopene. To this end, various e cyclase 
chimera were constructed from an e-cyclase cDNA from romaine lettuce, a plant known 
to accumulate substantial amounts of a carotenoid with two e rings (10, 11), and an e- 
cyclase cDNA from Arabidopsis, which adds only a single e-ring to lycopene. These 
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chimeric e-cyclases were assayed for their ability to convert lycopene into the bicyclic 
e-carotene in a strain of Escherichia coll engineered to accumulate lycopene. Through 
this approach, a catalytic region for each of the Arabidopsis and lettuce e-cyclases was 
defined as being integral to the determination of ring number. Additionally, by using this 
approach, chimera were obtained wherein the catalytic domain of Arabidopsis was 
switched for the catalytic domain of lettuce and vice versa. 

Sources of enzyme include those eukaryotic and prokaryotic organisms, which 
produce carotenoids including plants, algae, yeasts, fungi, cyanobacteria and other 
photosynthetic bacteria. Preferred plants are lettuce, Arabidopsis, potato, Adonis, 
marigold or tomato. Preferred algae are of the genus dunaliella and haematococcos! 

Enzymes include but are not limited to low abundance, membrane-associated 
enzymes, members of the carotenoid cyclase family as well as enzymes that catalyze 
reactions that utilize symmetrical substrates in the carotenoid pathway such as phytoene 
saturase, beta carotene hydroxylase, epsilon, epsilon-carotene, zeaxanthin . and 
violaxanthin. 

The low abundance, membrane-associated enzymes include phytoene 
desaturase, beta ring hydroxylase, epsilon ring hydroxylase, violaxanthin de-epoxidase 
and beta carotene ketolase. 

The carotenoid cyclase enzymes include p- and e-cyclases, neoxanthin 
synthases, capxanthin capsorubin synthases, and a cyanobacterial {Synechococcus 
PCC7942) p-cyclase. Epsilon-cyclase is a most preferred embodiment. 

Suitable vectors contain a eukaryotic or prokaryotic gene encoding an enzymatic 
domain catalyzing a reaction of carotenoid biosynthesis or metabolism. Alternatively, the 
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vectors contain a chimeric polynucleotide encoding a chimeric enzyme containing an 
enzymatic domain from a related gene from another source. Any such vector contains a 
suitable promoter for the host, and can be constructed using techniques well kno\A^ in 
the art (Sambrool< et aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor. NY, 1989). Suitable vectors for prokaryotic expression 
include pACYC184, pUC119, pBR322 (New England Biolabs, Beverley, MA), pTrcHis 
(Invitrogen), Bluescript SK (Stratagene), pET28 (Novagen) and derivatives thereof. 

The present vectors can additionally contain regulatory elements such as 
promoters, repressors, selectable markers such as antibiotic resistance genes, etc. 

The first and the second primer are designed to hybridize anywhere within the 
full-length sequence for the respective polynucleotide including non-coding regions. 
Primers for obtaining chimeric polynucleotide constructs by an inverse polymerase 
chain reaction as described herein below and depicted in Figure 2, were designed so 
that the first primer recognizing the first polynucleotide hybridizes in the 3'-5' direction 
and the second primer recognizing the second polynucleotide hybridizes in the 5'-3' 
direction. Preferred first primers include but are not limited to SEQ ID NOS: 3. 5, 7, 9, 
11, 13, 15, 17 and 19. Preferred second primers include but are not limited to SEQ ID 
NOS: 4, 6, 8, 10,12, 14, 16, 18 and 20. 

As shown in Table 1 , preferred pairing for the first and second primers includes 
but is not limited to SEQ ID NOS 3 and 4; SEQ ID NOS: 5 and 6; SEQ ID NOS: 7 and 8; 
SEQ ID NOS: 9 and 10; SEQ ID NOS: 11 and 12; SEQ ID NOS: 13 and 14; SEQ ID 
NOS: 15 and 16; SEQ ID NOS: 17 and 18; and SEQ ID NOS: 19 and 20. 
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Host systems according to the present invention can comprise any organism that 
already produces carotenolds or which has been genetically engineered to produce 
carotenoids. Suitable hosts include E. coH, cyanobactefia such as Synechococcus and 
Synechocystis, algae and plant cells. E. coli is a preferred embodiment. 

Transformation of the hosts with the chimeric constructs or vectors of the present 
invention can be done using standard techniques well known in the art, and 
incorporated herein by reference. 

Transformed E. coli can be cultured using conventional techniques. The culture 
broth preferably contains antibiotics to select and maintain plasmids though the 
construct may alternatively be introduced within the genome where no antibiotic 
maintenance is required. Suitable antibiotics include penicillin, ampicillin, 
chloramphenicol, etc. Culturing is typically conducted at 15-45''C, preferably at room 
temperature (16-28°C) for 12 hours to 7 days. E coli cultures are plated and the plates 
are screened visually for colonies with a different color than the colonies of the host E. 
coli transformed with an empty vector. 

The mapping method of the instant invention encompasses identifying catalytic 
domains of enzymes based in the enzyme catalyzing a reaction on a suitable substrate 
and resulting in the formation of a different product or a modified substrate. A preferred 
enzyme and substrate combination is sn^yciase and lycopene, respectively, and the 
result of the enzyme catalysis reaction is the formation of a carotene compound such as 
the introduction of epsilon and beta rings by e-cyclase (7), 

Methods for detecting formation of carotenoid compounds include but are not 
limited to colorlmetric assays, HPLC. TLC, mass spectrometry and absorption 
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spectroscopy. Colorimetric assays, more preferably color complementation assays, 
were used to identify the formation of a different carotenes generated from tlie enzyme 
catalysis reactions of the chimeric enzymes of the invention on a lycopene substrate. 
For confirmation, HPLC and absorption spectroscopy were used to identify the 
carotenoids produced in lycopene-^ccumulating E. coli transfected with different 
chimeric constructs. The above-mentioned assay methods are not limited to detection of 
carotenoids in E. coli, but include any host cell expressing lycopene and capable of 
producing carotenoids, or otherwise transfected with any one of the chimeric constructs 
of the invention. 

Carotenoids include but are not limited to 8,e-carotene, 8,\|/-carotene, or a 
combination thereof, and p,p-carotene. 

In a preferred embodiment, the addition of epsilon cyclic end groups to the pink- 
colored lycopene results in the formation of products that are yellow or yellow-orange in 
color. Therefore, the functioning of the epsilon lycopene cyclase of the invention was 
detected by a change in the color of E. coli cultures that accumulate lycopene. 

Another method of the present invention for identifying an enzyme-catalyzing 
domain in a eukaryotic or prokaryotic e-cyclase enzyme, comprises 

a) providing a vector containing a polynucleotide encoding the full-length enzyme 
and a primer for hybridizing to the polynucleotide; 

b) performing site-directed mutagenesis using the primer and the vector for 
obtaining a construct containing a truncated polynucleotide encoding a fragment of the 
enzyme; 
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c) transf acting a host cell with the construct and growing the host cell under 
conditions for expressing a truncated protein encoded by the truncated polynucleotide; 

d) allowing enzyme catalysis with the truncated protein on an enzyme-specific 
substrate in the host cell, 

wherein the substrate is preferably, lycopene or another symmetrical substrate of 
the pathway of carotenoid synthesis and metabolism; and 

e) identifying the enzyme-catalyzing domain encoded by the truncated protein by 
formation of a carotenoid compound from the enzyme catalysis of step d). 

Another method of the present invention for identifying an enzyme-catalyzing 
domain in a eukaryotic or prokaryotic carotenoid-synthesizing enzyme, comprises 

a) providing a vector containing a polynucleotide encoding the full-length enzyme 
and a primer for hybridizing to the polynucleotide; 

b) performing site-directed mutagenesis using the vector and the primer for 
obtaining a construct containing a site-directed mutant of the polynucleotide encoding 
the enzyme; 

c) transfecting a host cell with the construct and growing the host cell under 
conditions for expressing a site-directed mutant of a protein encoded by the site- 
directed mutant of the polynucleotide; 

d) allowing enzyme catalysis of the site-directed mutant of the protein on an 
enzyme-specific substrate in the host cell, 

wherein the substrate is preferably, lycopene; and 



14 



wo 02/061050 PCT/US02/00667 

e) identifying the enzyme-catalyzing domain encoded by the site-directed mutant 
of the protein by formation of a carotenoid compound from the enzyme catalysis of step 
d). 

Primers used for obtaining truncated polynucleotide constructs or constructs 
containing site-directed mutants for polynucleotides encoding the enzymes of the 
invention include those described herein below. Preferred primers include but are not 
limited to SEQ ID NOS: 22^0. 

The above-described methods were used to identify sequence determinants 
regulating catalytic activity in the preferred enzymes encoded by the polynucleotides of 
the invention. 

More preferably, these methods were used to identify an internal region of six 
amino acid residues (underlined In the alignment of Figure 4) regulating ring number 
determination for the preferred enzymes of the invention. In a preferred embodiment, 
the six amino acid segment Implicated in determination of ring number for the lettuce 
and Arabidopsis LCYe is displayed in Figure 6. 

With respect to each of the 6 amino acid residues for the catalytic domain of the s- 

cyclase enzyme family, 

the first amino acid position of the 6 amino acids can be alanine (A), serine (S), 
glutamic acid (E) or asparagine (D), and is preferably S for an enzyme introducing two 
epsilon rings; 

the second amino acid position of the 6 amino acid region can be arginine (R), 
leucine (L), histidine (H) or isoleucine (1), and is preferably H or R for an enzyme 
introducing two rings; 
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the third amino acid position of the 6 amino acid region can be isoleucine (I) or 
leucine (L), and Is preferably I for an enzyme introducing two rings; 

the fourth amino acid position of the 6 amino acid region can be valine (V) or 
leucine (L), and is preferably V for an enzyme introducing two rings; 

the fifth amino acid position of the 6 amino acid region can be glutamine (Q), 
leucine (L) or lysine (K), and is preferably L for an enzyme introducing two rings; and 

the sixth amino acid position of the 6 amino acid region can be phenylalanine (F), 
leucine (L). methionine (M) or leucine (L), and is preferably M for an enzyme introducing 
two rings. 

The amino acid residues for the catalytic region of the lettuce e-cyclase (lettuce 
LCYe) producing 8,s-carotene are SHIVLM (SEQ ID NO: 41) or SRIVLM (SEQ ID NO: 
42). 

The amino acid residues for the catalytic region of the Arabidopsis e-cyclase 
(Arabidopsis LCYe) producing 8,\i/-carotene are ALIVQF (SEQ ID NO: 43). 

The amino acid residues for the catalytic region of the potato e-cyclase (potato 
LCYe) producing e,\|/-carotene are ALILQL (SEQ ID NO: 44). 

The amino acid residues for the catalytic region of the tomato e-cyclase (tomato 
LCYe) producing 8,\|/-carotene are ALILQL (SEQ ID NO: 45). 

The amino acid residues for the catalytic region of the marigold e-cyclase 
(marigold LCYe) producing e.v-carotene are ALIVQM (SEQ ID NO: 46). 

The amino acid residues for the catalytic region of the Adonis e-cyclase (Adonis 
LCYe1) producing e,\|/- and e,e-carotene are ELIVQL (SEQ ID NO: 47). 
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Finally, the amino acid residues for the catalytic region of the Arabidopsis p- 
cyclase (Arabidopsis LCYb) producing p,p-carotene are DILLKL (SEQ ID NO: 48). 

These methods were use to identify a single amino acid that determines whether 
the monocyclic -J -carotene or the bicyclic e-carotene is produced from lycopene through 
the action of the Arabidopsis and lettuce e-cyclases. More preferably, for most efficient 
synthesis of two epsilon rings, Arabidopsis or other monocyclic epsilon cyclase catalytic 
domains would be entirely replaced with a catalytic domain of lettuce comprising SHIVLM 
(SEQ ID NO: 41) or SRIVLM (SEQ ID NO: 42). Polynucleotides containing site-directed 
mutations in the region encoding the catalytic domain for Arabidopsis e-cyclase (LCYe) 
were constructed by replacing single amino acid residues for each of the naturally 
occurring residues of this enzyme. The inventors determined that conversion of lycopene 
into a bicyclic e-carotene is primarily dependent on the second amino acid position of the 
6 amino acid catalytic region of the Arabidopsis and lettuce e-cyclase enzyme being H or 
R. 

The relatedness of two carotenoid cyclase enzymes of differing functionality has 
allowed the Inventors to identify an amino acid residue that determines the distinctive 
properties of each. The success of this discovery demonstrates the utility of a domain 
swapping approach for identification of regions and residues of importance in the 
catalytic properties of a low abundance, membrane-associated enzyme (see also 29, 30 
for use of a similar approach for soluble enzymes). This approach holds promise for 
identifying regions and residues integral to the functioning of other members of the 
carotenoid cyclase family (Figure 7). 
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Transgenic organisms can be constructed which include the polynucleotides of the 
invention by methods described in PCT/US99/10461, and herein incorporated by 
reference in its entirety. The Incorporation of chimeric polynucleotides containing the 
catalytic domain of one gene from one species inserted into the analogous sequence of a 
related gene from another species, can allow the controlling of carotenoid synthesis, 
content or composition in the host cell. In another approach, polynucleotides containing 
site-directed mutations in the catalytic domain of the e-cyclase gene can also be 
transfected into host cells. More preferably, organisms which do not otherwise produce 
8,8-carotene such as Arabidopsis, can be transfected with the lettuce enzyme, a chimeric 
enzyme containing the lettuce catalytic domain or a modified form of the endogenous 
epsilon cyclase gene. Alternatively, mutations in the catalytic domain of the endogenous 
gene may be introduced and selected. 

Any of the chimeric constructs of the invention can be used to increase the 
amount of bicyclic e-carotene in an organism and thereby alter the nutritional value, 
pharmacology or visual appearance of the organism. In addition, the transformed 
organism can be used in the formulation of therapeutic agents, for example treatment of 
cancer (32, 33). 

Appropriate transgenic hosts include plants such as marigold, tomato, pepper, 
banana, potato, and the like. Essentially any plant is suitable for expressing the preferred 
chimeric enzyme constructs, but the preferred plants are those which already produce 
high levels of carotenoids, and those which are normally ingested as foods or used as a 
source of carotenoid pigments. In particular, plants which bear fruit can be genetically 
manipulated to provide tissue-specific expression in fruit. Marigold is a particulariy 
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preferred host, because it can be used as a "bioreactor" for bulk production of 
carotenoids, and is actually grown commercially as a carotenoid source for chicken feed. 
For expression in marigold, a promoter can be used which is flower-specific. Another 
preferred transgenic plant is tomato, because this plant already produces high levels of 
lycopene, and it has been reported that there is a correlation between consuming 
tomatos and decreased incidence of colon cancer. Also, preferred are plants wherein the 
visual appearance is characterized by accumulation of carotenoids wherein the color 
properties could be changed by introduction of a modified epsllon cyclase. 

The objects of the invention have been achieved by a series experiments, some of 
which are described by way of the following non-limiting examples. 

Examples 

Example 1 . Plant e-cyclase cDNAs. 

A Lactuca sativa var. romaine (romaine lettuce) leaf cDNA library was obtained from Dr. 
Harry Yamamoto (12). An Adonis aestivalis var. palaestina flower cDNA library has been 
described (13). The libraries were screened for cDNAs encoding lycopene cyclases by 
functional "color" complementation in Eschericliia coli as previously described (7). A 
Solanum tuberosum (potato) e-cyclase cDNA (GenBank accession 827545) was 
obtained from Dr. Nicholas J. Provart, Institut fuer Genbiologische Forschung, Berlin, 
Germany. Nucleotide sequences of the various cDNAs, and of chimera and mutants of 
these (see below), were determined by the DNA Sequencing Facility of the Center for 
Agricultural Biotechnology at the University of Maryland. 
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Example 2. Construction and characterization of chimeric and truncated 8-cyclases, and 

site-directed mutants thereof. 

A, Synthesis of chimera by inverse PCR 

An inverse PCR strategy (Figure 2) was developed to create chimera of the lettuce and 
Arabidopsis E-cyciase cDNAs. Plasmids containing both cDNAs, in tandem and in the 
same orientation, were first constructed. Plasmid templates were linearized by digestion 
with an appropriate restriction enzyme (6/pl for plasmid pLse/Ate, see Figure 2; Nco\ for 
pAte/Lse). PCR was performed in 100 pi tubes in an MJ Research PTC-1 50-25 
MiniCycler with heated lid. The reaction volume of 50 |jl contained 100 ng of template 
DNA and 2.5 units of Platinum Ffx DNA polymerase (Life Technologies). Concentrations 
of primers, dNTPs, magnesium, and buffer components were as suggested in the 
manufacturer's protocol. Oligonucleotide primer combinations used to construct the 
various chimera are listed in Table 1 . 



Table 1. Oligonucleotides for construction of chimeric and truncated 8-cycIases and site- 
directed mutants 


Construct 


Primer Name and 
Junction 


'Primer Sequence 


Lse/Ste 262/108 


(none) ELGG/PRVS 


conserved Avail sites: GGWCC 

(SEQ ID NO: 1) 


Lse/Ste 285/6 


(none) YDPD/LGLQ 


conserved BglU sites: AGATCT 

(SEQ ID NO: 2) 


Lse/Ate 323/320 


Lse323N KIFF/EETC 


cgaagaatatttttgttggagaca (SEQ ID NO: 3) 


Ate324C 


aggagacatgtttggcctcaa (SEQ ID NO: 4) 


Lse/Ate 443/435 


Lse443N WPLE/RKRQ 


ttcaagtggccacaatgtttc (SEQ ID NO:. 5) 


Ate444C 


aggaaaagacagagagcattctt (SEQ ID NO: 6) 


Lse/Ate 461/453 . 


Lse4 6lN IVLM/DTEG 


attagcacgatgtgtgatagtcc (SEQ ID NO: 7) 


Ate462C . 


ggataccgaaggcattagaag (SEQ ID NO: 8) 


Lse/Ate 498/490 


Lse498N IIFA/LYMF 


gcaaatattatcaaatccgttgaaga (SEQ ID NO: 9) 


Ate499C 


tttatacatgttcgtcatttcaccaaa (SEQ ID NO: 10) 


Ate/Lse 390/395 


Ate394N ATGY/SWR 


tagcctgttgcgggatgta (SEQ ID NO: 11) 


Lse395C 


ttcagttgttcgatctttgtcag (SEQ ID NO: 12) 


Ate/Lse 408/413 


AteN412 VIAE/ILRQ 


tctgcgatgactgatgcatatt (SEQ ID NO: 13) 
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LseC314 (typo by 
vendor) 


gattttaagacaagatcaatctaaagagatg 

(SEQ ID NO: 14) 


Ate/Lse 420/430 


AteN429 QINS/NISK 


ctgttgatctgtttggtagtctcttc (SEQ ID NO: 15) 




LseC430 


taacatttcaaaacaagcatgggaaa (SEQ ID NO: 16) 


Ate/Lse 440/450 


AteN449 RQRA/FFLF 


atgctctctgtcttttcctttctg (SEQ ID NO: 17) 




LseC450 


tctttctattcggactatcacacatc (SEQ ID NO: 18) 


Ate/Lse 455/465 


AteN464 FDTE/GTRT 


cttcggtatcgaattgaactatgag (SEQ ID NO: 19) 




LseC465 


gaacacgtacatttttccgtactttc (SEQ ID NO: 2 0) 


Ate 


(noxie ) 


Full length cDNA in pBluescript SK-; not a 
LacZ fusion (SEQ ID NO: 21) 


TV 4- A A 1 T XT 

Ate AJLxn 


{Ncol) 


caAcCatggcggtttcaacatttcc (SEQ ID NO: 22) 


Ate a58N 


EPS-DETTA59 
(Ncol ) 


tgtgCCATggtgagagaagatttcgctgac 

(SEQ ID NO: 23) 


Ate a82N 


EPS-delta82 
(Ncol) 


ttcCCatgGagcagaacaaagatatggatga 

(SEQ ID NO: 24) 


Ate a88N 


EPS-delta88 
(Ncol) 


aagCCatggatgaacagtctaagcttgttg 

(SEQ ID NO: 25) 


Ate A103N 


(none) 


Used Munl/mung bean + fused to Ncol /Kl enow 


Ate a23C 


Ate/K511tag 
(=taa) 


gatgattgatgagaccttAtctcaaattgtttggtg 

(SEQ ID NO: 27) 


Ate AIiIVQF447- 
52/SHIVLM 


Ate/456-61SHIVLM 

(Dralll) 


cjccttcgcytatcCaTtAgCacGatgTgtgAaagaccaaagag 

(SEQ ID NO: 28) 


Ate L448H 


AteL457H/FspI 


qatatcgaattgaactatgTgCgcaagaccaaagagaaag 

(SEQ ID NO: 29) 


Ate L448D 


AteL457D/EcoRV 


aatatcaaattgaacGatATCtgcaagaccaaagagaaag 

(SEQ ID NO: 30) 


Ate L448R 


AteL457R/BssHII 


ggtatcgaattgaactatgCgCgcaagaccaaagagaaag 

(SEQ ID NO: 31) 


Ate A447D 


AteA456D/BglII 


cgaattgaactatgagATcTagaccaaagagaaagaatgc 

(SEQ ID NO: 32) 


Lse 


(none) 


E\ill length cDNA in pBluescript bis.— , noTi 
a lacZ fusion (SEQ ID NO: 33) 


Lse a92N 


(none) 


Used pre-existing Ncol; fusion in TrcHisA 

(SEQ ID NO: 34) 


Lse A107N 


(none) 


Used Munl/mung bean + fused to Ncol/Klenow 
of pTrcHisA (SEQ ID NO: 35) 


Lse a23C 


Lse/M511tag 


atgtctaaccagttccTAtctcaagctgtgaggtgc 

(SEQ ID NO: 36) 


Lse H457L 


Lse/H457L 


ccctctagatccattagcacgatgAgtgatagtccgaataga 
aag (SEQ ID NO: 37) 


Lse H457D 


LseH457D/EcoRV 


cctctaaatccattagcacgatAtCtgatagtccgaatagaa 
ag (SEQ ID NO: 38) 
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Lse H457R 


Lse/H457R/XhoI 


ctaaatccattagcacgatTCTCgaGagtccgaatagaaaga 
agg <SEQ ID NO: 39) 


Lse L460Q 


Lse/L46PQ 


gttccctctagatccattTgcacgatgtgtgatag 

(SEQ ID NO: 40) 


* Bases altered or introduced by piimecs are in bold uppesrcase letters. Restrictioa enzyme sites introduced by primers 
are underlined. 



Typical cycling parameters were: 94 **C for 3 min, fifteen cycles of 94 755/ 68 
for 20/60/360 sec, ten cycles of 94 /55/ 68*^0 for 20/60/360 + 15 additional sec each 
cycle, 68**C for 10 min, and hold at 4*^0. PGR products were purified by gel 
electrophoresis (0.8% SeaKem GTG agarose; FMC BioProducts) and recovered from 
gels using the GENECLEAN kit (BiolOI, Inc.). The ends of the recovered PGR products 
were phosphorytated with T4 polynucleotide kinase (New England BioLabs, Inc.; 5 units 
of enzyme and one-half of the recovered PGR product in a final volume of 10 pi) with 
incubation at 37 ''C for 30 min, and the reactions were then cooled on ice. T4 DNA ligase 
(0.5 pi containing 200 NEB units; New England BjoLabs) was then added, and samples 
were incubated for 12-16 h at 15 **G. One pi of each ligation mixture was used to 
transform chemically competent E. coli (25 pi of XL10 Gold cells; Stratagene Gloning 
Systems, Inc.), and the transformation mixture was plated on a single large (15 cm) 
Luria-Bertani (LB) agar (1.5%, w/v) plate containing 150 pg/ml ampicillin (sodium salt). 
The resulting colonies (typically several thousand) were collected and combined in 5-10 
ml LB medium. The plasmids were purified and transformed Into a pink colored, 
lycopene-accumulating E. coli strain (14) for analysis. Pigments were extracted and 
analyzed from several of the resulting yellow colonies (i.e., a yellow color is indicative of 
an active cyclase). Usually more than 50% of the colonies were yellow. The plasmid from 
one of the colonies was recovered, and the nucleotide sequence was determined to 
verify the construct. 
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An Avail site located in the same relative position in the lettuce and potato 
e-cyclase cDNAs was used to construct a chimeric lettuce/potato e-cyclase In which the 
first 262 amino acids of the encoded polypeptide derived from the lettuce cDNA and the 
subsequent 272 were specified by the potato cDNA. The plasmid containing this chimeric 
cDNA Is referred to as pLse262/Ste108. The product of this chimeric cDNA converted 
lycopene to <y -carotene in E. coll (data not shown) thereby indicating that the potato 
enzyme is a mono-e-cyclase. 

B. Synthesis of truncations for and site-directed mutants of e-cyclases by site-directed 
mutagenesis. 

N-terminal truncations of cDNAs encoding the Arabidopsis and lettuce lycopene e- 
cyclases were created using restriction sites or by using the CHAMELEON™ 
Double-Stranded, Site-Directed Mutagenesis Kit (Stratagene Cloning systems, Inc.) to 
introduce a A/col site at the position desired for the initiation codon. The resulting product 
was excised and inserted into the Nco\ site of plasmid vector pTrcHisA (Invltrogen, Inc.), 
downstream of and in frame with the inducible Ttv promoter. C-terminal truncations were 
created by introducing termination codons at the desired positions. Various other site- 
specific mutations, usually accompanied by the introduction of a restriction site to 
facilitate the identification of mutants, were also created with the CHAMELEON™ kit. 
Primers (SEQ ID NOS: 22-40) used to introduce the various mutations are listed in Table 
1 . Mutations were confirmed by analysis of the nucleotide sequence. 

Example 3. Mapping and identification of catalytic domains by analysis of chimera and 
truncations for e-cyclase. 
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Carotenoids with two e-rings are uncommon in plants. A notable exception is 
romaine lettuce, where lactucaxanthin (e,e-carotene-3,3'-diol) comprises as much as 
21% (mol/mol) of the total carotenoid pigment in the leaves (10. 11). The inventors 
selected twenty-six prospective lycopene c-cyclase cDNAs in a screen of a romaine 
lettuce leaf cDNA library. Other than in length, the cDNAs appeared to be identical. The 
complete nucleotide sequence of the longest cDNA was ascertained, and a plasmid 
construct. pDY4 containing a subcloned cDNA, was introduced into a 
lycopene-accumulating strain of E. coli for analysis of the activity of the encoded 
enzyme. 

Accumulation of carotenoids with two e rings in lettuce is known to occur in vivo 
(10, 11). Introduction of the lettuce e-cyclase cDNA into the lycopene-producing E. coli 
yielded predominantly (>90%) £-carotene as indicated by the HPLC retention time 
(Figure 3. panel A) and absorption spectrum (Figure 3, panel D) of the major product. In 
marked contrast, introduction of the Arabidopsis LCYe into lycopene-accumulating E. 
coli yielded, as earlier reported (7), approximately 98% of monocyclic S -carotene for 
the total amount of carotenoid produced (Figure 3, panel B; absorption spectrum in 
panel E). The HPLC elution profile of a lycopene-accumulating control culture (Figure 3, 
panel C), and the absorption spectrum of the major compound in this strain (lycopene; 
Figure 3, panel F) are also displayed for comparison. Elution times were ca. 12.2 min 
for lycopase (\|/,\|/-carotene), 14.3 min for 5-carotene (8,\i/-carotene), and 17.1 min for e- 
carotene (E,E-carotene). 

Tl-ie Arabidopsis and lettuce lycopene e-cyclases are substantially similar in their 
deduced amino acid sequences (ca. 77 % overall identity; Figure 4 (the alignment was 
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created using Clustal X version 1.8 (17). GenBank accession numbers are listed at the 
ends of the sequences. Asterisks above the alignment are spaced every 10 residues. 
Numbers to the right denote the number of the amino acid residue that ends the row)) 
and closely resemble other known LCYe (Figure 8). 

Specific amino acid differences at the N-terminus of the e-cyclases were initially 
thought to be involved in determination of ring number. The lettuce and Arabidopsis 
£-cyclase cDNAs were therefore modified so as to produce polypeptides truncated at 
the N terminus. Carotenoids that accumulate in an E. coli strain containing the indicated 
cDNA subcloned into plasmid vector pBluescript SK-, and that otherwise accumulates 
only lycopene (ip,vjj-carotene), are indicated to the right. Only the predominant 
carotenoid (>90% of the total in all cases) is listed. Solid black vertical lines connecting 
the Arabidopsis and lettuce cyclases at the top of the figure indicate identically- 
conserved amino acid residues. LsE/AtE 323/320 defines a chimera consisting of the 5' 
portion of the lettuce e-cyclase cDNA up to and including nucleotide bases specifying 
amino acid residues 323 and the 3' portion of the Arabidopsis e-cyclase cDNA 
beginning with nucleotide bases that encode amino acid residue 320 and proceeding to 
the end of the cDN A. 

Substantial portions of the N-termini of the lettuce and Arabidopsis e-cyclases 
were found to be nonessential to catalytic function (Figure 5), but all truncations that 
yielded an active enzyme did not alter the mixture of products produced from lycopene in 
E. coli (Figure 5). 
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C-terminal truncations were also constructed, and for those truncations wliere 
even a relatively small portion of the polypeptide was deleted, e-cyclase enzyme activity 
in E. coll was completely eliminated (Figure 5), 

These catalytic regions were further characterized using a series of chimeric 
cDNAs encoding portions of both the lettuce and Arabidopsis LCYe. The chimeric cDNAs 
were constructed by an inverse PCR-based method (Figure 2) to minimize constraints on 
the choice of the chimera junction. Construction of the chimeras, and the activity of their 
respective polypeptide products are presented in Figure 5. Constructs for the lettuce 
bi-e-cyclase with the N-termlnal portion and the Arabidopsis bi-e-cyclase for the C- 
terminal portion of the chimeric cDNAs are shown. Characterization of these chimeric 
lettuce/Arabidopsis e-cyclases defined a region of six amino acids (underlined in the 
alignment of Figure 4) that is involved in ring number determination. These initial 
experiments did not rule out that other amino acids elsewhere in the polypeptides might 
also influence the ring number. A second series of chimeras using the Arabidopsis 
cDNA as the N-terminus and the lettuce cDNA as the C-terminus, identified the same 6 
amino acid domain as conferring ring number determination (Figure 5). 

Amino acids within this region and in the context of the rest of the amino acid 
sequence are able to confer whether the enzyme adds one or two rings to lycopene. The 
six amino acid segment implicated in determination of ring number is displayed in Figure 
6 for the lettuce and Arabidopsis LCYe. 

Similarly conserved residues are shown in black text on a gray background. 
Three amino acid residues in the lettuce bi-e-cyclase that differ significantly from those in 
the known mono-e-cyclases are in white text on a black background. Sequences of an 
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Arabidopsis LCYb (a bicyclase) and an Adonis LCYe of mixed function are displayed 
below the lettuce LCYe with two residues of interest shown in white text on a black 
bacl<ground. Similarity was defined according to the Blosum 45 scoring matrix (19): DE, 
NH, ST, QKR, FYW, LIVM). GenBank accession numbers: Adonis LCVel, AF321535; 
Arabidopsis LCYb, U50739; Arabidopsis LCYe, U50738; lettuce LCYe, AF321538; 
marigold LCYe, AF251016; potato LCYe, AF321537; tomato LCYe, Y14387. 

For comparison, sequences in this region for other known mono-e-cyclases are 
also displayed. The nucleotides that specify these amino acids in the Arabidopsis 
e-cyclase (ALIVQF) were replaced with those that specify the amino acids of the lettuce 
£-cyclase (SHIVLM). The enzyme produced by this cDNA (mutant AtE 
ALIVQF447-52SHIVLM) functions even better than the lettuce e-cyclase (Table 2), 
confirming that determination of ring number is influenced by one or more of the amino 
acids in this small region of the polypeptide. 

Within the six amino acid region mapped by the chimeric lettuce and Arabidopsis 
e-cyclases (Figure 5), only four residues differ between the two sequences (Figure 6). Of 
these four differences, the residue at position M461 of lettuce vs. the residue at position 
F452 of Arabidopsis is likely unimportant because it is a conservative replacement and 
because the marigold mono-e-cyclase (18) also has an M residue in this position (Figure 
6). The residue in position H457 of lettuce (vs. residue in position L448 in Arabidopsis) 
and the residue in position L460 of lettuce (vs. residue in position Q451 in Arabidopsis) 
are the most conspicuous differences relative to the sequence of the Arabidopsis LCYe. 

The inventors have shown that the change of a single amino acid in the 
polypeptide sequences of the Arabidopsis and lettuce lycopene e-<:yclases has a 
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profound influence on the ability of these enzymes to add a second e-ring to the 
symmetrical substrate lycopene. The gain of function engendered in the Arabidopsis 
LCYe mutants L448H and L448R and the importance of this specific amino acid residue 
are all the more compelling when contrasted against the loss of function in the lettuce 
LCYe mutant H457L. This single amino acid at position 448, thus regulates molecular 
switching for ring number determination by lycopene £-cyclases. 

The lycopene e-cyclases are members of an extended family of carotenoid 
modifying enzymes (Figure 7) that includes capsanthin-capsorubin synthase (CCS; 21) 
and the recently identified neoxanthin synthase (NSY; 22, 23), as well as lycopene 
P-cyclase (LCYb; 7, 1 8. 20, 26). 

A lycopene p-cyclase from the cyanobacterium Synechococx:us PCC7942 (14) 
was used as the outgroup. Branch lengths are drawn to scale. Bootstrap values greater 
than 50% for 10,000 replicates with a seed value of 111 are Indicated. The analysis 
encompassed 398 positions, beginning with the initiating Met of the Synechococcus 
cyclase, and excluded those positions with gaps in the alignment. The amino acid 
sequence alignment and GenBank accession numbers for the nucleotide sequences are 
shown in Figure 8. The method of Saitou and Nei (24) was used to construct the tree. 
Distances were corrected for multiple substitutions (25). 

LCYb and CCS each act at both ends of their respective symmetrical substrates, 
while NSY acts at only one end of the symmetrical violaxanthin. The known plant LCYb 
do not contain a basic residue in the position corresponding to H457 of the lettuce LCYe; 
instead they contain the nonpolar I residue (Figures 6 and 8). The ability of LCYb to add 
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two p rings to lycopene must, therefore, derive from an alternative solution to that which 
confers a bicyclase activity to the lettuce LCYe. 

A more complete conversion to e-carotene by Arabidopsis LCYe mutant 
ALIVQF44742SHIVLM (98% e-carotene) was observed compared to the mutant L448H 
(92% e-carotene; Table 2). This indicated that the preceding amino acid residue (a 
nonpolar A447 in Arabidopsis vs. a polar S456 of lettuce) influences ring number 
determination. The known plant p-cyclases contain an acidic residue (D) In this position. 
Two closely-related Adonis aestivalis e-cyclase cDNAs (Figure 6) also specify an acidic 
amino acid (E) in this position, but do not othenA^ise differ significantly from 
mono-e-cyclases in this region (Figure. 6). The Adonis LCYe produce a preponderance of 
e-carotene in lycopene-accumulating E. coli (Table 2). However, conversion of the A447 
residue of the Arabidopsis LCYe to a D did not yield a bicyclase (Table 2). indicating that 
the identity of this residue does not, by itself, determine ring number. 

Lycopene is a symmetrical, nonpolar C40 hydrocarbon (Figure 1) that is insoluble 
in aqueous solutions and accumulates in membranes and oil bodies of plant cells. There 
is considerable uncertainty regarding the orientation of lycopene and other carotenoids in 
the plane of the membrane (see 27 for a discussion), and also of the position of the 
cyclase enzyme within or on the surface of the membrane (7, 26). If lycopene spans (i.e, 
is perpendicular to the plane of the membrane) or partially penetrates the membrane, 
then the two ends of the molecule will almost certainly not be equally accessible to the 
cyclase. There is experimental evidence that the two ends of p-carotene differ in 
accessibility to the hydroxylase enzyme that converts this compound to zeaxanthin 
(P,p-carotene-3,3'-diol; 28). As was suggested for the hydoxylase enzyme (26, 28), the 
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addition of two rings to lycopene may depend on an ability of the cyclase to form dimers, 
whereby binding of the more accessible end of the substrate by one of the subunits 
would serve to bring the other end of the carotenoid molecule into proximity of the 
cognate subunit where catalysis could then proceed. The region encompassing the L447 
residue of the Arabidopsis LCYe might then constitute an interfacial surface that 
mediates subunit interaction. 

Example 4. Identification of amino acid residues conferring £-cyclase activity. 

Plasmids containing individual e-cyclase cDNAs, chimera or site-directed mutants 

were transformed into lycopene-accumulating E. coli strain TOP10 (14). Cultures in six 

ml LB medium containing 150 pg/ml ampicillin and 30 ng/ml chloramphenicol were grown 

for 1 day with shaking in darkness at 28*^0 as described previously (15). Cells were 

harvested by centrifugation and pigments were extracted and analyzed by HPLC 

essentially as described previously (7, 16), except that an isocratic mobile phase of 40% 

B was used for the analysis. Pigments were identified on the basis of absorption spectra 

and HPLC retention times relative to standard compounds. 

Table 2. Activity of Eycopene e-cyclases and site-directed mutants 
with lycopene (\|/,\|/-carotene) as substrate in E, coli 

s-CyclasecDNA Mutation *Carotenoids 

lyc : del : eps 

(none) - 100 : 0 : 0 

Arabidopsis (AtE) wild type (y2) 1 : 98 : 1 
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AtE ALIVQF447-52SHrVLM 0 : 2 : 98 

AtE L448H 0 : 8 : 92 

AtE L448R 0 : 8 : 92 

AtE L448D 37 : 56 : 8 

AtE A447D 1 : 98 : 1 

Lettuce (LsE) wUdtype(DY4) 3 : 8 : 90 

LsE H457R 3 : 6 : 91 

LsE H457D 22 : 18 : 60 

LsE H457L 17 : 73 : 10 

Adonis (AaEl) wild type (Ad3) 0 : 44 : 56 



*Carotenoicls accumulated in a strain of E. coli that also contains the 

plasmid pAC-LYC (14) and thereby nomially accumulates lycopene. 

Abbreviations: lyc, lycopene ; del, D-carotene; eps, D-carotene. 

Data given as mol : mol : mol. 

The activity of a lettuce L460Q mutant did not differ significantly 
from that provided by the wild type lettuce cDNA. The lettuce H457L 
mutant, In contrast, exhibited an activity comparable to that of the 
Arabidopsis enzyme: ^-carotene was the predominant product 
accumulated in E. coli. Conversely, the corresponding Arabidopsis 
mutant, L448H, gained the ability to produce e-carotene as the 
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predominant product. Thus, the identity of the amino acid residue within 
this single position of the lettuce and Arabidopsis sequences specifies 
whether a monocyclase or bicyclase activity results. 

The Arabidopsis L448 and lettuce H457 were also changed to D 
and R residues in order to gain insight as to what properties of the residue 
in this position influence the determination of ring number For both the 
Arabidopsis and lettuce e-cyclases, conversion to an R, like H a positively 
charged residue, gave results essentially identical to those obtained with 
an H codon at this position (see AtE L448R and LsE H457R). Conversion 
to D (AtE L447D and LsE H457D), a negatively charged residue, greatly 
impaired the overall activity of the enzymes (i.e. a substantial proportion of 
the substrate lycopene remained) and reduced, though did not eliminate, 
formation of £-carotene. 
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WHAT IS CLAIMED: 

1. A chimeric polynucleotide encoding a carotene-synthesizing enzyme, comprising a 
polynucleotide encoding an N-terminal portion of a first enzyme and a C-terminal portion 
of a second enzyme. 

2. A chimeric polynucleotide encoding a bicyclic-epsilon-carotenoid synthesizing 
enzyme, comprising a polynucleotide encoding an N-terminal portion of a first enzyme 
and a C-terminal portion of a second enzyme. 

3. A chimeric polynucleotide encoding a bicyclic-e-carotene-synthesizing enzyme, 
comprising a polynucleotide encoding an N-terminal portion of a lettuce enzyme and a 
C-terminal portion of an Arabidopsis enzyme. 

4. A chimeric polynucleotide encoding a bicyclic-e-carotenersynthesizing enzyme, 
comprising a polynucleotide encoding an N-terminal portion of an Arabidopsis enzyme 
and a C-terminal portion of lettuce enzyme. 

5. The chimeric polynucleotide as in one of claims 1-4, wherein the enzyme is an e- 
cyclase. 

6. The chimeric polynucleotide as in one of claims 1-4, wherein the enzyme has a 
catalytic domain comprising six amino acids. 
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7. The chimeric polynucleotide of claim 6, wherein 

the first amino acid residue of the six amino acids is alanine (A), serine (S), 
glutamic acid (E) or asparagine (D); 

the second amino acid position of the six amino acids is arginine (R), leucine (L), 

histldine (H) or isoleucine (I); 

the third amino acid position of the six amino acids is isoleudne (I) or leucine (L); 

the fourth amino acid position of the six amino acids is valine (V) or leucine (L); 

the fifth amino acid position of the six amino acids is glutamine (Q), leucine (L) or 
lysine (K); and 

the sixth amino acid position of the six amino acids is phenylalanine (F), leucine 
(L), methionine (M) or leucine (L). 

8. The chimeric polynucleotide of claim 7. wherein the catalytic domain comprises amino 
acids residues SHIVLM (SEQ ID NO: 41) or SRIVLM (SEQ ID NO: 42). 

9. A e-cyclase enzyme comprising a catalytic domain of amino acid residues SHIVLM 
(SEQ ID NO: 41) or SRIVLM (SEQ ID NO: 42). 

10. A method for identifying an enzyme-catalyzing domain In a eukaryotic or prokaryotic 
carotenoid-synthesizing enzyme, comprising 

a) providing a first polynucleotide encoding a full-length enzyme and a second 
polynucleotide encoding a full-length enzyme, each polynucleotide being subcloned in 
tandem into a vector; 



39 



wo 02/061050 PCTAJS02/00667 

b) providing a first primer for hybridizing to tlie first polynucleotide and a second 
primer for hybridizing to the second polynucleotide; 

c) performing an inverse polymerase chain reaction using the first and the 
second primer and the vector to obtain a construct containing a chimeric polynucleotide 
containing a 5' end of the first polynucleotide and a 3' end of the second polynucleotide; 

d) repeating steps b) and c) with a plurality of different first primers and a plurality 
of different second primers for obtaining a plurality of constructs containing different 
chimeric polynucleotides for scanning along the encoded amino acid sequence one 
amino acid at a time; 

e) transfecting a host cell with one or more of the plurality of constructs and 
growing the host cell under conditions for expressing chimeric proteins encoded by the 
chimeric polynucleotides; 

f) performing enzyme catalysis with the chimeric proteins on an enzyme-specific 
substrate in the host cell, and 

g) identifying the enzyme-catalyzing domain encoded by the chimeric proteins by 
identification of at least one caroterioid compound from the enzyme catalysis of step f). 

1 1 . The method of claim 1 0, wherein the first polynucleotide encodes an N-terminal 
portion of a lettuce enzyme and the second polynucleotide encodes a C-terminat portion 
of an Arabidopsis enzyme. 
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12. The method of claim 10. wherein the first polynucleotide encodes an N-terminal 
portion of an Arabidopsis enzyme and the second polynucleotide encodes a C-terminal 
portion of a lettuce enzyme. 

13. The method of claim 10. wherein the enzyme is s-cyclase. 

14. The method of claim 10 or 11, wherein the first primer is a nucleotide sequence 
selected from the group consisting of SEQ ID NO: 3. SEQ ID NO: 5, SEQ ID NO: 7 and 
SEQ ID NO: 9, and the second primer is a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 and SEQ ID NO: 10, 

15. The method of claim 10 or 12, wherein the first primer is a nucleotide sequence 
selected from the group consisting of SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15. 
SEQ ID NO: 17 and SEQ ID NO: 19, and the second primer is a nucleotide sequence 
selected from the group consisting of SEQ ID NO: 12, SEQ ID NO: 14. SEQ ID NO: 16. 
SEQ ID NO: 18 and SEQ ID NO: 20. 

16. A method for identifying an enzyme-catalyzing domain in a eukaryotic or 
prokaryotic carotenoid-synthesizing enzyme, comprising 

a) providing a vector containing a polynucleotide encoding the full-length enzyme 
and a primer for hybridizing to the polynucleotide; 
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b) performing site-directed mutagenesis using the vector and the primer for 
obtaining a construct containing a site-directed mutant of the polynucleotide encoding 
the enzyme; 

c) transfecting a host cell with the construct and growing the host cell under 
conditions for expressing a site-directed mutant of a protein encoded by the site- 
directed mutant of the polynucleotide; 

d) allowing enzyme catalysis with the site-directed mutant of the protein on an 
enzyme-specific substrate in the host cell; and 

e) identifying the enzyme-catalyzing domain encoded by the site-directed mutant 
of the protein by identification of a carotenoid compound from the enzyme catalysis of 
step d). 

17. A method for identifying an enzyme-catalyzing domain in a eukaryotic or prol<aryotic 
carotenoid-synthesizing enzyme, comprising 

a) providing a vector containing a polynucleotide encoding the full-length enzyme 
and a primer for hybridizing to the polynucleotide; 

b) performing site-directed mutagenesis using the primer and the vector for 
obtaining a construct containing a truncated polynucleotide encoding a fragment of the 
enzyme; 

c) transfecting a host cell with the construct and growing the host cell under 
conditions for expressing a truncated protein encoded by the truncated polynucleotide; 

d) allowing enzyme catalysis with the truncated protein on an enzyme-specific 
substrate in the host cell; and 
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e) identifying the enzyme-catalyzing domain encoded by the truncated protein by 
identification of a carotenoid compound from the enzyme catalysis of step d). 

18. A method for producing 8,8-carotene in an 8,e-carotene-deficient, lycopene- 
expressing host, comprising transfecting the host with a chimeric polynucleotide 
encoding a host-specific s-cyclase enzyme containing a catalytic domain according to 
SEQ ID NO: 41 or SEQ ID NO: 42 and expressing the chimeric polynucleotide. 

19. A method for increasing 8,e-carotene in a lycopene-expressing host, comprising 
transfecting the host with a chimeric polynucleotide encoding a host-specific e-cyclase 
enzyme containing a catalytic domain according to SEQ ID NO: 41 or SEQ ID NO: 42 
and expressing the chimeric polynucleotide. 
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cDNA 



Deduced AA Sequence Product 



Arabldopsis LCYo m^PSZi; ^IVQF bxSQXS^ 458 8,v-«arotene 

Potato liCYG TWl^^lSXi '^Zl^oii DX^GZiR 312 e,v-carotene 

Tomato LCre VFhkGSU JOmJ^JOsL DXSQXR 460 8,v-«arotene 

Marigold LCYe VJ^t^FGU OJ.pG'CR 450 E,>|/-carotene 

Zietfcuce ItCYG )F^Fg£ Bl^vk 467 8,6>€arotene 



Adonis XiCYel S^SfGCCi SlIVOD pX^!m 463 e,V~andE,E- 
Arabidopsis LCYb ^S^p^^PfM ^^^Xb p^^T^ 435 p,p-caFotene 



Cunningham and Gantt 
Figure 6 
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FIGURE 8 (CONT.) 
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SEQUENCE LISTING 



<110> Cunningham, Francis x. 



<120> Methods for Determining Ring Number in carotenoids By Lycopene Epsilon 
cyclases and uses Thereof 

<130> 108172-00055 

<150> 60/261,473 
<151> 2001-01-12 



<160> 48 



<170> Patent In version 3.1 

<210> 1 

<211> 5 

<212> DNA 

<213> conserved Avail sites 



<400> 1 
ggwcc 

<210> 2 

<211> 6 

<212> DNA 

<213> conserved Bglll sites 



<400> 2 
agatct 

<210> 3 
<211> 24 
<212> DNA 
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<213> Lettuce/Arabi dopsi s 
<400> 3 

cgaagaatat ttttgttgga gaca 

<210> 4 
<211> 21 
<212> DNA 

<213> Lettuce/Arabi dopsi s 
<400> 4 

aggagacatg tttggcctca a 

<210> 5 
<211> 21 
<212> DNA 

<213> Lettuce/Arabi dopsi s 
<400> 5 

ttcaagtggc cacaatgttt c 

<210> 6 
<211> 23 
<212> DNA 

<213> Lettuce/Arabi dopsi s 
<400> 6 

aggaaaagac agagagcatt ctt 

<210> 7 

<211> 23 

<212> DNA 

<213> Lettuce/Arabi dopsi s 
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24 



21 



21 



<400> 7 

attagcacga tgtgtgatag tec 



2 



23 
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<210> 8 

<211> 21 

<212> DNA 

<213> Lettuce/Arabi dopsi s 



<400> 8 

ggataccgaa ggcattagaa g 

<210> 9 

<211> 26 

<212> DNA 

<213> Lettuce/Arabi dopsi s 



<400> 9 

gcaaatatta tcaaatccgt tgaaga 26 

<210> 10 
<211> 27 
<212> DNA 

<213> Lettuce/Arabi dopsi s 
<400> 10 

tttatacatg ttcgtcattt caccaaa 27 

<210> * 11 
<211> 19 
"<212> DNA 

<213> Arabi dopsi s/Lettuce 



<400> 11 

tagcctgttg cgggatgta 19 

<210> 12 

<211> 23 

<212> DNA 

<213> Arabi dopsi s/Lettuce 
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<400> 12 

ttcagttgtt cgatctttgt cag 23 

<210> 13 
<211> 22 
<212> DNA 

<213> Arabi dopsi s/Lettuce 
<400> 13 

tctgcgatga ctgatgcata tt 22 

<210> 14 
<211> 31 
<212> DNA 

<213> Arabi dopsi s/Lettuce 
<400> 14 

gattttaaga caagatcaat ctaaagagat g 31 

<210> 15 
<211> 26 
<212> DNA 

<213> Arabi dopsi s/Lettuce 
<400> 15 

ctgttgatct gtttggtagt ctcttc 26 

<210> 16 

<211> 26 

<212> DNA 

<213> Arabi dopsi s/Lettuce 



<400> 16 

taacatttca aaacaagcat gggaaa 26 



<210> 17 
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<211> 24 
<212> DNA 

<213> Arabi dopsi s/Lettuce 



<400> 17 

atgctctctg tcttttcctt tctg 24 

<210> 18 
<211> 26 
<212> DNA 

<213> Arabi dopsi s/Lettuce 
<400> 18 

tctttctatt cggactatca cacatc 26 

<210> 19 
<211> 25 
<212> DNA 

<213> Arabi dopsi s/Lettuce 
<400> 19 

cttcggtatc gaattgaact atgag 25 

<210> 20 

<211> 26 

<212> DNA 

<213> Arabi dopsi s/Lettuce 



<400> 20 

gaacacgtac atttttccgt actttc 26 

<210> 21 

<211> 1860 

<212> DNA 

<213> Arabi dopsi s 
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<400> 21 


araaTTagax 


tcctcttuct 


gcttgctalia 


ecu Lganaga 


acaaxaTaac 


DU 




g"tc"tt:c"tcgc 


tgtattcgaa 


arrarrxgga 


ggaggaaaat 


ggagrgrgiiT: 






aiziiticgcagc 


«^2itggcggtt 


tcaacati'Ctc 


cgticatggag 


't'tg'tcgaagg 




aaatttccag 


tggttaagag 


atacagctat 


aggaatattc 


gtttcgg ttt 


gtgtagtgtc 


240 


agagctagcg 


gcggcggaag 


ttccggtagt 


gagagttgtg 


tiagcggtgag 


agaagatttc 


300 


gctgacgaag 


aagattttgt 


gaaagctggt 


ggttctgaga 


ttctatttgt 


tcaaatgcag 


360 


cagaacaaag 


at:at:ggatga 


acagtctaag 


cttgttgata 


agttgcctcc 


t:atat:caal:t 


420 


ggtgatggtg 


ctttggatca 


tgtggttatt 


ggttgtggtc 


ctgctggttt 


agccttggct 


480 


gcagaatcag 


ctaagcttgg 


attaaaagtt 


ggactcattg 


gtccagatct 


tccttttact 


540 


aacaattacg 


gtgtttggga 


agatgaattc 


aatgatcttg 


ggctgcaaaa 


atgtattgag 


600 


catgtttgga 


gagagactat 


tgtgtatctg 


gatgatgaca 


agcctattac 


cattggccgt 


660 


gcttatggaa 


gagttagtcg 


acgtttgctc 


catgaggagc 


ttttgaggag 


gtgtgtcgag 


720 


tcaggtgtct 


cgtaccttag 


ctcgaaagtt 


gacagcataa 


cagaagcttc 


tgatggcctt 


780 


agacttgttg 


cttgtgacga 


caataacgtc 


attccctgca 


ggcttgccac 


tgttgcttct 


840 


ggagcagctt 


cgggaaagct 


cttgcaatac 


gaagttggtg 


igacctagagt 


ctgtgtgcaa 


900 


actgcatacg 


gcgtggaggt 


tgaggtggaa 


aatagtccat 


atgatccaga 


tcaaatggtt 


960 


ttcatggatt 


acagagatta 


tactaacgag 


aaagttcgga 


gcttagaagc 


tgagtatcca 


1020 


acgtttctgt 


acgccatgcc 


tatgacaaag 


tcaagactct 


tcttcgagga 


gacatgtttg 


1080 


gcctcaaaag 


atgtcatgcc 


ctttgatttg 


ctaaaaacga 


agctcatgtt: 


aiagaiitagat 


1140 


acactcggaa 


ttcgaattct 


aaagacttac 


gaagaggagt 


ggtcctatat 


cccagttggt 


1200 


ggttccttgc 


caaacaccga 


acaaaagaat 


ctcgcctttg 


gtgctgccgc 


tagcatggta 


1260 


catcccgcaa 


caggctattc 


agttgtgaga 


tctttgtctg 


aagctccaaa 


atatgcatca 


1320 


_5tcatGgcag 


agatactaag 


agaagagact 


accaaacaga 


tcaacagtaa 


tatttcaaga 


1380 


caagcttggg 


atactttatg 


gccaccagaa 


aggaaaagac 


agagagcatt 


ctttctcttt 


1440 


ggtcttgcac 


tcalzagttca 


attcgatacc 


gaaggcatta 


gaagcttctt 


ccgtactttc 


1500 


ttccgccttc 


caaaatggat 


gtggcaaggg 


tttctaggat 


caacattaac 


atcaggagat 


1560 


ctcgttctct 


ttgctttata 


catgttcgtc 


atttcaccaa 


acaatttgag 


aaaaggtctc 


1620 


atcaatcatc 


tcatctctga 


tccaaccgga 


gcaaccatga 


taaaaaccta 


tctcaaagta 


1680 


tgatttactt 


atcaactctt 


aggtttgtgt 


atatatatgt 


tgatttatct 


gaataatcga 


1740 


tcaaagaatg 


gtatgtgggt 


tactaggaag 


ttggaaacaa 


acatgtatag 


aatctaagga 


1800 


gtgatcgaaa 


tggagatgga 


aacgaaaaga 


aaaaaatcag 


tctttgtttt 


gtggttagtg 


1860 
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<210> 22 

<211> 25 

<212> DNA 

<213> Arabidopsis 



<400> 22 

caaccatggc ggtttcaaca tttcc 25 

<210> 23 

<211> 30 

<212> DNA 

<213> Arabidopsis 



<400> 23 . ->r^ 

tgtgccatgg tgagagaaga tttcgctgac 30 

<210> 24 

<211> 31 

<212> DNA 

<213> Arabidopsis 



<400> 24 

ttcccatgga gcagaacaaa gatatggatg a 31 

<210> 25 

<211> 30 

<212> DNA 

<213> Arabidopsis 

<400> 25 

aagccatgga tgaacagtct aagcttgttg 30 

<210> 26 

<211> 6 

<212> DNA 

<213> Arabidopsis 

7 



wo 02/061050 



PCT/US02/00667 



<400> 26 
ccatgg 



6 



<210> 27 

<211> 36 

<212> DNA 

<213> Arabidopsis 

<400> 27 

gatgattgat gagaccttat ctcaaattgt ttggtg 36 

<210> 28 

<211> 42 

<212> DNA 

<213> Arabidopsis 



<210> 29 

<211> 40 

<212> DNA 

<213> Arabidopsis 

<400> 29 

ggtatcgaat tgaactatgt gcgcaagacc aaagagaaag 40 

<210> 30 

<211> 40 

<212> DNA 

<213> Arabidopsis 



<400> 28 

gccttcggta tccattagca cgatgtgtga aagaccaaag ag 



42 



<400> 30 

ggtatcgaat tgaacgatat ctgcaagacc aaagagaaag 



40 



<210> 31 
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<211> 40 

<212> DNA 

<213> Arabidopsis 



<400> 31 

ggtatcgaat tgaactatgc gcgcaagacc aaagagaaag 40 

<210> 32 

<211> 40 

<212> DIMA 

<213> Arabidopsis 



<400> 32 

cgaattgaac tatgagatct agaccaaaga gaaagaatgc 40 

<210> 33 

<211> 1780 

<212> DNA 

<213> Lettuce 



<400> 33 
gaaacaaatg 


acgtgaaagt 


tcttcaaaat tgaattaatt 


gtaatcctga aaacttgatt 


60 


tgtgatagaa 


gaatcaatgg 


agtgctttgg agctcgaaac 


atgacggcaa caatggcggt 


120 


ttttacgtgc 


cctagattca 


cggactgtaa tatcaggcac 


aaattttcgt tactgaaaca 


180 


acgaagattt 


actaatttat 


cagcatcgtc ttcgttgcgt 


caaattaagt gcagcgctaa 


240 


.^agcgatrcgt 


tgtgtagtgg 


ataaacaagg gatttccgta 


gcagacgaag aagattatgt 


300 


gaaggccggt 


ggatcggagc 


tgttttttgt tcaaatgcag 


cggactaagt ccatggaaag 


360 


ccagtctaaa 


ctttccgaaa 


agctagcaca gataccaatt 


ggaaattgca tacttgatct 


420 


ggttgtaatc 


ggttgtggcc 


ctgctggcct tgctcttgct 


gcagagtcag ccaaactagg 


480 


gttgaacgtt 


ggactcattg 


gccctgatct tccttttaca 


aacaattatg gtgtttggca 


540 


ggatgaattt 


ataggtcttg 


gacttgaagg atgcattgaa 


cattcttgga aagatactct 


600 


tgtatacctt 


gatgatgctg 


atcccatccg cataggtcgt 


gcatatggca gagttcatcg 


660 


tgatttactt 


catgaagagt 


tgttaagaag gtgtgtggaa 


tcaggtgttt catatctaag 


720 


ctccaaagta 


gaaagaatca 


ctgaagctcc aaatggctat 


agtctcattg aatgtgaagg 


780 
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caatatcacc 


attccatgca 


ggcttgctac 


tgttgcatca 


ggggcagctt 


cagggaaatt 


840 


tctggagtat 


gaacttgggg 


gtccccgtgt 


ttgtgtccaa 


acagcttatg 


gtatagaggt 


900 


tgaggttgaa 


aacaacccct 


atgatccaga tctaatggtg ttcatggatt 


atagagactt 


960 


ctcaaaacat 


aaaccggaat 


ctttagaagc 


aaaatatccg 


actttcctct 


atgtcatggc 


1020 


catgtctcca 


acaaaaatat 


tcttcgagga 


aacttgttta 


gcttcaagag 


aagccatgcc 


1080 


tttcaatctt 


ctaaagtcca 


aactcatgtc 


acgattaaag 


gcaatgggta 


tccgaataac 


1140 


aagaacgtac 


gaagaggaat 


ggtcgtatat 


ccccgtaggt 


ggatcgttac 


ctaatacaga 


1200 


acaaaagaat 


ctcgcatttg 


gtgctgcagc 


tagtatggtg 


caccctgcca 


cagggtattc 


1260 


agttgttcga 


tctttgtcag 


aagcticctaa 


tiiatigcagca 


gtcattgcta 


agattttaag 


1320 


acaagatcaa 


tctaaagaga 


tgatttctct 


tggaaaatac 


actaacattt 


caaaacaagc 


1380 


atgggaaaca 


ttgtggccac 


ttgaaaggaa 


aagacagcga 


gccttctttc 


tattcggact 


1440 




ycyc Lad-Cgy 


atctagaggg 


aacacgtaca 


tttttccgta 


C LC LXuCg 




tttgcccaaa 


tggatgtggt 


ggggattttt 


ggggtcttct 


ttatcttcaa 


cggatttgat 


1560 


aatatttgcg 


ctttatatgt 


ttgtgatagc 


acctcacagc 


ttgagaatgg 


aactggttag 


1620 


acatctactt 


tctgatccga 


caggggcaac 


tatggtaaaa 


gcatatctca 


ctatatagat 


1680 


ttagattata 


taaataatac 


ccatatcttg 


catatatata 


agccttattt 


atttcttttg 


1740 


tatccttaca 


acaacatact 


cgttaattat 


atgtttttta 






1780 



<210> 34 

<211> 6 

<212> DNA 

<213> Lettuce 

<400> 34 
ccatgg 

<210> 35 

<211> 6 

<212> DNA 

<213> Lettuce 



<400> 35 
ccatgg 

<210> 36 
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<211> 36 

<212> DNA 

<213> Lettuce 

<400> 36 

atgtctaacc agttcctatc tcaagctgtg aggtgc 

<210> 37 

<211> 45 

<212> DNA 

<213> Lettuce 

<400> 37 

ccctctagat ccattagcac gatgagtgat agtccgaata gaaag 

<210> 38 

<211> 44 

<212> DNA 

<213> Lettuce 

<400> 38 

cctctagatc cattagcacg atatctgata gtccgaatag aaag 

<210> 39 

<211> 45 

<212> DNA 

<213> Lettuce 

<400> 39 

ctagatccat tagcacgatt ctcgagagtc cgaatagaaa gaagg 

<210> 40 

<211> 35 

<212> DNA 

<213> Lettuce 
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<400> 40 

gttccctcta gatccatttg cacgatgtgt gatag 35 

<210> 41 

<211> 6 

<212> PRT 

<213> Lettuce 



<400> 41 

Ser His He Val Leu Met 
1 5 



<210> 42 

<211> 6 

<212> PRT 

<213> Lettuce 



<400> 42 

Ser Arg lie val Leu Met 
1 5 



43 
6 

PRT 

Arabidopsis 

<400> 43 

Ala Leu He Va1 
1 



<210> 
<211> 
<212> 
<213> 



<210> 
<211> 
<212> 
<213> 



44 
6 

PRT 
Potato 



Gin Phe 
5 



<400> 44 
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Ala Leu lie Leu Gin Leu 
1 5 



<210> 45 

<211> 6 

<212> PRT 

<213> Tomato 



<400> 45 ^ 

Ala Leu l1e Leu Gin Leu 
1 5 



<210> 46 

<211> 6 

<212> PRT 

<213> Marigold 

<400> 46 

Ala Leu lie val Gin Met 
1 5 

<210> 47 

<211> 6 

<212> PRT 

<213> Adonis e- cyclase 

<400> 47 

Glu Leu xle val Gin Leu 
1 5 

<210> 48 

<211> 6 

<212> PRT 

<213> Arabidopsis 



<400> 48 
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Asp He Leu Leu Lvs Leu 
1 5 
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